ANALYTIC SUBORDINATION THEORY OF 
OPERATOR- VALUED FREE ADDITIVE CONVOLUTION AND 
THE SOLUTION OF A GENERAL RANDOM MATRIX 

PROBLEM 

SERBAN T. BELINSCHI, TOBIAS MAI, AND ROLAND SPEICHER 

Abstract. We develop an analytic theory of operator- valued additive free 
convolution in terms of subordination functions. In contrast to earlier inves- 
tigations our functions are not just given by power series expansions, but are 
defined as Frcchct analytic functions in all of the operator upper half plane. 
Furthermore, we do not have to assume that our state is tracial. Combining 
this new analytic theory of operator-valued free convolution with Anderson's 
selfadjoint version of the linearization trick we are able to provide a solu- 
tion to the following general random matrix problem: Let x\ , . . . , be 
selfadjoint N X N random matrices which are, for N — > oo, asymptotically 
free. Consider a selfadjoint polynomial p in n non-commuting variables and 
let PW be the element pW = p(x[ N) , . . . , xi N) ). How can we calculate the 
asymptotic eigenvalue distribution of p( N > out of the asymptotic eigenvalue 
distributions of x[ N) , . . .,xi N) ? 



1. Introduction 

Free probability theory [?5J HH] originated from questions about the structure 
of von Neumann algebras related to free groups, but has evolved into a subject with 
links to many other, a priori quite unrelated, fields: most notably random matrices, 
but also combinatorics, representation theory of large groups, mathematical physics, 
or as applied fields as financial correlations and wireless communications. 

The central notion in the theory is the one of "free independence" or "freeness" . 
This replaces in the context of non-commuting variables the classical notion of 
independence. Whereas the latter is based on tensor products, free independence 
is modelled according to free products. 

The relation between free probability theory and random matrices relies on the 
fundamental insight of Voiculescu |40j that typical independent random matrices 
become asymptotically free in the large N limit. This triggered a surge of new 
results on operator algebras as well as on random matrices. Based on this, free 
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probability techniques have become prominent in wireless communications |35] or 
in computational methods for random matrices |30j . 

In this paper we will provide a solution to the following basic problem in free 
probability theory. 

Problem 1. Let x\,...,x n be selfadjoint elements which are freely independent. 
Consider a selfadjoint polynomial p in n non-commuting variables and let P be the 
element P = p(xi, . . . ,x n ). How can we calculate the distribution of P out of the 
distributions of Xi, . . . , x n ? 

By Voiculescu's asymptotic frcencss results typical random matrices (like de- 
terministic and independent Wigner, Wishart, or Haar unitary matrices) become 
asymptotically free in the large N limit. This means that the asymptotic eigenvalue 
distribution of a random matrix, which is given as some polynomial in such asymp- 
totically free random matrices, is given by the distribution of the corresponding 
polynomial in free variables. Thus, if we can deal with polynomials in free random 
variables we can deal with any random matrix of this type. Hence our result yields 
as a direct consequence also a solution to the following random matrix problem. 

Problem 2. Let x[ N \ . . . , Xn N ^ be selfadjoint N x N random matrices which 
are, for N oo, asymptotically free. Consider a selfadjoint polynomial p in n 
non-commuting variables and let pW be the element pW = p(x[ N \ . . . ,Xn N ^)- 
How can we calculate the asymptotic eigenvalue distribution of pW out of the 
asymptotic eigenvalue distributions of X[ , . . . , X^ ? 

For both Problems 1 and 2 there is a long list of contributions which provide 
solutions for special choices of the polynomial p. In the context of free probabil- 
ity, Voiculescu solved in [38] and [39] Problem 1 for the cases of p(x, y) = x + y 
and p(x, y) — xy 2 x (corresponding to the additive and multiplicative free convo- 
lution) with the introduction of the R- and S'-transform, respectively. Nica and 
Speicher could give in [27] a solution for the problem of the free commutator, 
p(x,y) — i[xy — yx). In the random matrix context, Problem 2 was addressed for 
various polynomials - and usually, also for specific choices of the distributions of 
the x\ N ^ - in work of: Marchenko-Pastur [24] , Girko [TH], Bai and Silverstein [12"] ; 
Tulino and Verdu [35]; Miiller [55]; Moustakis and Simon [35]; Hechem, Loubaton, 
and Najim [2T]; Couillet and Debbah [TS]; and many more. For a more extensive 
list of contributions in this context we refer to the books [12l [35l [13]. Some of 
those situations were also treated by operator-valued free probability tools, see in 
particular [341 [5] . The general case for the anti-commutator in the random matrix 
context, p{x, y) = xy + yx, was treated by Vasilchuk 36J. 

All those investigations where specific for the considered polynomial and up to 
now there has not existed a master algorithm which would work for all polynomi- 
als. We will provide here such a general algorithm. Our solution will rely on a 
combination of the so-called linearization trick with new advances on the analytic 
theory of operator-valued convolutions; from a technical point of view the latter is 
the main contribution of the present work. 

The general philosophy of the linearization trick is that a complicated scalar- 
valued problem can be transformed into a simpler operator-valued problem. This 
idea can be traced back to the early papers 42J of Voiculescu, but has become 
quite prominent in the seminal paper [50] of Haagerup and Thorbj0rnsen. More 
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precisely, the linearization trick from [20j says that for understanding properties of a 
polynomial p(%i, ■ . ■ , x n ) in some (in general, non-commuting) variables Xi, . . . , x n 
it suffices to consider a linear polynomial L := ai ® x± + • • • a n ® x n . The price 
to pay for this transition from a general to a linear polynomial is that the linear 
polynomial L has operator- valued coefficients (i.e., the a±, . . . , a n are matrices, of 
suitably chosen size, depending on the polynomial p, but not on the variables 
x±, . . . , x n ). We will be mainly interested in selfadjoint polynomials p. The original 
approach of Haagerup and Thorbj0rnsen has the drawback that it does not preserve 
selfadjointness. However, recently Anderson [2 refined the linearization trick in 
that respect, i.e., he showed that the coefficients ai can actually be chosen in such 
a way that L is also selfadjoint. 

So the linearization trick gives that in order to understand the distribution of 
p{xi, . . . , x n ) it suffices to understand the operator- valued distribution of L. How- 
ever, L is now an operator- valued linear combination of x\, . . . ,x n . Since basic 
properties of freeness imply that the freeness of x\, . . . ,x n implies the operator- 
valued freeness of a\ ® Xi, . . . , a n ® x n we get in our situation that the distribution 
of L is given by the operator-valued free convolution of its summands; the operator- 
valued distribution of aj®Xj, however, is determined by the distribution of X{. So we 
have reduced our original non-linear problem to a linear one about operator-valued 
convolution. 

There exists already a theory by Voiculescu for dealing with such operator- valued 
convolutions, relying on the so-called operator-valued i?-transform. Up to now, this 
theory has essentially been developed on the level of formal power series, see |l2"ll33| . 
It has, however, to be remarked that we need a good analytic description of the 
operator- valued convolution in order to arrive at a useful description of the solution 
of our problem. Since there is rarely a situation where explicit solutions of those 
equations are available (and that holds true for both the scalar-valued and the 
operator- valued case), it is important that the equations are given in a form which 
are analytically controllable and numerically implementable. In the scalar-valued 
case is has become more and more apparent in the last years (see, e.g., [1]) that us- 
ing a subordination approach to free convolution (which, as also noted - and for the 
first time explicitly formulated - in |14j , is equivalent to the i?-transform approach, 
but has better analytic properties) is more suited for analytic questions. The sub- 
ordination property (see statements (1) and (3) of Theorem 2.2 below) has first 
been proved in [41] by Voiculescu for scalar-valued free additive convolution, with 
the purpose of investigating L p -regularity of convolution measures in the context 
of free entropy. In the operator-valued case there exist also already subordination 
results, by Biane [10] and Voiculescu [43l [44] , based on a power-series description of 
the functions involved and algebraic properties of freeness with amalgamation. Our 
main contribution in this article to the question of subordination will be to develop 
a general analytic theory of subordination for general operator-valued convolutions 
and to show that the fixed point equations arising from this subordination formu- 
lation are indeed analytically controllable and numerically easily implementable. 

We also want to point out that the previous works of Biane and Voiculescu were 
restricted to a tracial frame; in our general analytic approach, however, we dot need 
to assume that our conditional expectation is given with respect to a tracial state, 
or for that matter that such a state at all exists on the given algebra. 
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The content of the article is as follows. In Section 2, we develop the analytic 
subordination description of operator-valued convolutions. Theorem |2.2| is the main 
result about this. In Section 3, we will recall the linearization trick, in the form as 
given by Anderson, and streamlined to our needs. In Section 4, we will combine the 
linearization trick with our description of operator- valued convolution and show how 
to solve Problem 1 (and thus also Problem 2) for general selfadjoint polynomials 
p. In Section 5, we will present some examples, by applying our algorithm to 
special polynomials and also compare this with numerical simulations of eigenvalue 
distributions for corresponding random matrices. 

2. The ^-transform and subordination in the operator-valued 

CONTEXT 

Following Voiculescu [42] . we will call an operator-valued non- commutative prob- 
ability space a triple (Ai,K,B), where Ai is a unital Banach algebra, B C M. is a 
Banach subalgebra containing the unit of M. , and E : M. — » B is a unit-preserving 
conditional expectation. In most cases, M. will be a von Neumann algebra, B a 
W* -subalgebra of M. and thus E will be completely positive weakly continuous. 
Elements in M. will be called operator-valued (or B-valued) random variables. The 
distribution of a random variable x G Ai with respect to E is, by definition, the set 
of multilinear maps 

fi x := {m x n : B n ~ x — > B : m n (bi, . . . , 6 n -i) = E[xbixb 2 ■ ■ ■ xb n -ix], n G N}. 

We call the n th moment of x (or, equivalently, of fi x ). It will be convenient to 
interpret mg as the constant equal to 1, the unit of B (or, equivalently, of M.) and 
mf = E[x], the expectation of x. We denote by B(xi, . . . , x n ) the algebra generated 
by B and the elements x\, . . . , x n in A4. 

Definition 2.1. Two algebras A%, A% CM containing B are called free with amal- 
gamation over B with respect to E (or just free over B) if 

E[x\X-2 •••x n ]=0 

whenever n € N, xj G Aj. satisfy E[xj] = for all j and ij ^ ij+\. 1 < j < n — 1. 
Two random variables x,y G M. are called free over B if B(x) and B(y) are free 
over B. 

If x,y G M. are free over B, then fi x +y depends only on fi x and fi y . Following 
Voiculescu, we shall denote this dependency by /u x ffl fj, y , and call it the free ad- 
ditive convolution of the distributions pL x and /i y . It is known |33[ 142] that ffl is 
commutative and associative. 

A very powerful tool for the study of operator-valued distributions is the gen- 
eralized Cauchy-Stieltjes transform and its fully matricial extension [351 S3] : f° r 
a fixed x G M, we define G x (b) = E \(b — x)^ 1 ] for all b € B for which b — x 
is invertible in M. One can easily verify that G x is a holomorphic mapping on 
an open subset of B. Its fully matricial extension Gx is defined on the set of 
elements b G M n (B) for which b — x <£> 1„ is invertible in M n (A4), by the relation 
G x n \b) = E ® IdM„(c) [(b — x(& In) -1 ] • It is a crucial observation of Voiculescu 
that the family {G x n ^} n >i encodes the distribution \i x of x. A succinct description 
of how to identify the n th moment of x when {G x } n >i is known is given in [5]. 
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Up to this point, all the notions we introduced required no involutive structure 
and no positivity on either M. or B. In particular, the definition of G x only requires 
the existence of a multiplicative structure on A4 so that \\xy\\ < M||x||||y|| for some 
M > independent of x and y (although a certain - bounded - dependence on ||a;|| 
and || y|| could obviously be allowed). The reader can verify for herself that this 
holds also for most of the analytic transforms which we introduce below. From now 
on, we will assume that M. is in addition a C* -algebra, B is a C*-subalgebra of At, 
and E is completely positive. 

In the following we will use the notation x > for the situation where x > 
and x is invertible; note that this is equivalent to the fact that there exists a real 
e > such that x > el. From the latter it is clear that x > implies E[x] > 
(because our conditional expectations are automatically completely positive). Any 
element x G A4 can be uniquely written as x — $tx + i^x, where 3ix = x+ £ and 
^sx = are selfadjoint. We call $lx and ^sx the real and imaginary part of x. 

From now on we shall restrict our attention to the case when x, y are selfadjoint. 
In this case, one of appropriate domains for G x - and the domain we will use most 
- is the operator upper half-plane H + (_B) := {b G B: 36 > 0}. Elements in this 
open set are all invertible, and H + (_B) is invariant under conjugation by invertible 
elements in B, i.e. if b G M + (B) and c G GL(B) is invertible, then cbc* G M + (B). 
It has been noted in [43] that g4™^ maps H + (M n (B)) into the operator lower half- 
plane W~(M n (B)) := — W + (M n (B)) and has "good behaviour at infinity" in the 
sense that lim bG x n) (b)= lim G x n) (b)b=l. 

|| f>— 1 1| — >0 || i> — x || — >-0 

As, from the analytic perspective, G^f 1 have essentially the same behaviour on 
H + (M n (B)) for any n G N, we shall restrict our analysis from now on to G x = G X X \ 
However, all properties we deduce for this G x , and all the related functions we shall 
introduce, remain true, under the appropriate formulation, for all n > 1. 

We shall use the following analytic mappings, all defined on H + (B); all trans- 
forms have a natural Schwarz-type analytic extension to the lower half-plane given 
by f(b*) = f(b)*; in all formulas below, x = x* is fixed in M: 

• the moment generating function: 

(1) * s (6) = E [(1 — bx)- 1 - 1] = E [(b- 1 - x)- 1 ] b- 1 - 1 = G x {b~ l )b~ l - 1; 

• The reciprocal Cauchy transform: 

(2) F x (b)=E[(b-x)- 1 y 1 = G x (b)- 1 ; 

• In this paper we shall call the following function "the h transform:" 

(3) h x (b) = E [(£> — x)- 1 ] ^ - b = F x (b) - 6; 
It has been shown in [S] that 

(4) %F x (b) > Sb, beU+(B), 
and thus 

(5) h x (M+(B))CE+(B). 

Based on the moment generating function, Voiculescu [H] introduced the operator- 
valued i?,-transform. It has the fundamental property that it linearizes free additive 
convolution of operator-valued distributions: if x and y are free over B, then 

(6) R x (b) + R y (b) = R x+y (b), for b in some neighbourhood of 0. 
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Equivalently, in terms of distributions, the above equation is written as 

We will use here the (equivalent) definition for the i?-transform provided by Dykema 
in [IT]. Define 

OO 

(7) C x (b)=E[(l-bx)- 1 ]b = Y, E tt bx ^ b > ll 6 ll<W _1 - 

n=0 

Then [TTJ Proposition 4.1], since C' x (0) = Ids, by the inverse function theorem there 
exists a unique B- valued analytic map R x defined on a neighbourhood of zero so 
that 

(8) C { - 1) (b) = {l + bR x (b))- 1 b = b(l + R x (b)b)- 1 , || b\\ small enough. 

(Cx ^ denotes the compositional inverse of C x on some small enough neighbourhood 
of zero.) Then [TTl Theorem 4.7] equation §6§ takes place. 

For our purposes, it is important to note that C^fe" 1 ) = G x (b),b £ M + (B). Then 
we have formally G { ~ 1} (b) = (b))' 1 = b' 1 (1 + bR x (b)) = (1 + R x (b)b)b-\ so 

that 

(9) R x (b) = G x - 1 Hb)-b-\ 

as in the scalar case [38]. To make this rigorous: there exists an r = r x > so that 
both C x and are defined on B(0,r) = {b e B: \\b\\ < r} and map B(0,r/2) 

strictly inside B(0, r). The set {b G M + (B): < r/2} is open and nonempty, 

since it contains bo = — 1. G x (w) = C x {w^ 1 ) will then necessarily map this set into 
5(0, r). Observe that T 

G x (b ) = E[(jl - x)- 1 } = --E[(^l + x 2 )- 1 ] - E[(^l + x 2 )-^} g MT(B) 

is trivially invertible and ||Ga;(&o)|| < j + r2 ^" < § whenever r < ||x||/4. By the 
continuity of the inverse function and the openness of the set of invertible elements, 
we have shown that there exists an open set V x included in M~(B) n B(0,r) on 
which G x ^ is defined and G x ^(b) — (C 2 T 1 (6)) _1 . Thus, on an open set included 
in B(0,r), equation ^ takes place. Since R x is uniquely determined on B(0,r), 
the identity principle for analytic maps on Banach spaces guarantees that Q is 
enough to determine R x , and knowledge of one of R or G (on the corresponding 
domains) implies knowledge of the other. 

In [43], under certain assumptions on (M,E, B), Voiculescu proved the existence 
of two fully matricial self-maps u)\,u}2 of W + (B) having good asymptotics "at in- 
finity" which satisfy the relations G x o u>i — G y o uj 2 = G x + y on M + (B). Thus, 
locally, oji = Gx o G x + y and lu2 = G y o G x + y . Provided one can show that the 
functions involved are well-defined on the set {b € M + (B): ||6 _1 || < r/2} for some 
r > 0, equations |6]) and Q provide us with the relation 

Wl (6)+wa(6) = Gi-'HG^m + G^HG^yib)) 

= R x (G x+y (b)) + R y {G x+y {b)) + 2G x+y (b)- 1 

= (R x+y (G x+y (b)) + G^yib)- 1 ) + G^yib)- 1 

(10) = b + F x+y (b), 
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for all 6 G {b G B+(B): ||& -1 || < r/2}. We have used ^ in the second and last 
equalities an d (|6| ) in the third. 

Equation ( |10[ ) appeared first in j5] Lemma 7.2] in the scalar case. The non- 
triviality of our argument will consist in "following the domains." Regrettably, 
this non-triviality haunts us: we are not able to conclude directly from the above 
that ujj(b) £ M + (B), and thus, a priori, we are not able to write G x (u>i(b)) = 
G y (uj 2 {b)) = G x+y (b) for all b G {b G H+(B): ||&— 1 1| < r/2}. To avoid this problem 
(and thus give a rigorous definition of wi and lo 2 ) we shall switch back to C: the 
function uj\ can be re-written as 

(11) Mr 1 ))" 1 = (Gi-^GWft- 1 )))" 1 = C x ^{C x+y (b)), \\b\\ < r/2. 

Thus, b i y (cji(6 -1 )) 1 has an analytic extension to a neighbourhood of zero which 
fixes zero. We shall denote this extension by 0\ and write 

(12) C x ( 0l (b)) = C x+y (b), \\b\\ small enough. 
Similarly C y (o 2 (b)) = C x+y (b), \\b\\ < r/2. 

Theorem 2.2. Assume that (A4,E, B) is a C* -operator-valued non- commutative 
probability space and x, y G M. are two selfadjoint operator-valued random variables 
free over B. Then there exists a unique pair of Frechet (and thus also Gateaux) 
analytic maps w\,u)2- M + (B) — > W + (B) so that 

(1) $$Wj(b) > 36 for all b G M+(B), j G {1,2}; 

(2) F x (w 1 (6)) + 6 = F y (uj 2 (b)) + b = cji(6) + cj 2 (6) for all b G U+(B); 

(3) G x ( Wl (6)) = G y (cj 2 (b)) = G x+y (b) for all b G M+(B). 

Moreover, if b € H + (i3) ; </ien wi(6) is the unique fixed point of the map 

f b : M+(B) H+(B), /bH = h y (h x (w) + 6) + 6, 

and wi(6) = lmin^oo f£ n (w) for any w G H + (£>). Same statements hold for uj 2 , 
with fi, replaced by w ^ h x (h y (w) + 6) + 6. 

Before starting the proof, let us give a brief outline of the reasoning and the main 
difficulties we will encounter here compared to the scalar case proof from [4] . There 
are two separate parts of this proof: one shows that the iterations described in the 
Theorem converge to analytic functions uji,uj 2 as claimed, and that the equalities 
F x (u!i(b)) = F y (ui 2 (b)) = u>i(b) +uj 2 (b) — b take place. The other part shows that in 
fact F x (ui(b)) = F x+y (b). Paradoxically, in the operator- valued case, this second 
part is considerably more difficult and we will have to dedicate to it several pages. 
The main idea will be to make a change of variable oi(6) = (wi(6 -1 )) -1 in order 
to be able to work on a neighbourhood of zero without worrying about domains of 
inverses, use the i?-transform to connect oi to C x + y , and finally prove that there 
exists a nonempty open set in M + (B) which is mapped by o\ in W + (B). This will 
allow us to come back to the functions u and conclude that on this open subset 
F x o ui 1 = F x+y . We next extend uj\ to all of H + (B) as a limit of iterations of fi, 
and use the uniqueness of analytic continuation to conclude. 

Proof. We shall begin by showing that functions u>\ , uj 2 satisfying item (3) in our 
theorem exist and are well defined on some open set in H + (i?). In order to do that, 



we shall use the change of variables advertised above. Recall from (11) and (12) 
the definition oi(b) — C x 1 \C x+y {b)) 1 and 02(6) = C y ^ (C x+y (b)), for 6 in some 
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neighbourhood of zero. On this neighbourhood, we can perform inversions of all 
functions C, and so the definition pj of the i?-transform in terms of the function C 
together with ^ allow us to write 

(13) (b-W-V (&)) 1 + (J,-*™ (&)) 1 = 1 + (b-^ (&)) 1 , 

for any invertible b of small norm (it is trivial that b \— > b~ 1 C < >~ 1 ' > (b) has an analytic 
extension to a neighbourhood of zero which takes values close to one, hence is 
invertible in B). 

Let us note that, from the definition and power series expansion Q of C, it 
follows that both b~ 1 C x (b) and C x (b)b~ l have well-defined analytic extensions to 
B(0, ||x|| _1 ), regardless of the actual invertibility of b. For the compositional inverse 
Ci~ 1} of C x the same holds true, as noted in equation (|8| (the reader can verify this 
also directly), on some (possibly smaller) neighbourhood of the origin. Thus, we 
will take the liberty to denote the extension of & i-> b~ 1 C x (b) to B(0, 1 1 a; 1 1 ~ 1 ) with the 
same notation b~ 1 C x (b), and similarly for cir 1 ^: in the following, writing b~ 1 C x (b) 
will imply no assumption on the invertibility of b. 

By substituting C x + y (b) for b in (13), we obtain 
(14) 

c x+y {b)- x ci-V{c x+v {b))) 1 + (c x+y {b)- x c!fV{c x+y {b))) 1 = i + b- x e x+v {b), 

for any b sufficiently small in B. Thus, 



C^yiby^iC^yib)) 



+ b- 1 C X + y (b) - (C^yibr^iC^yib))) 



1 "I 



The expression under the square brackets is close to 1, hence invertible. We recog- 
nize easily the functions o: 

-l 



Cx+y 

(b)- 1 o 2 (b) 



b- l C x+v {b) - (C x+v {b)- x o x {b)\ 



Here an implicit statement is made regarding the analytic continuation to a neigh- 
bourhood of zero of C x + V {b)- 1 o 2 {b),C x+y {b)- 1 o 1 {b) and (C+^&rVWr 1 : these 
statements follow from the analytic continuation of o\ = d ^ oC x + y with o\ (0) = 0, 
the analytic continuation of w i— > C x (w)w^ 1 , and the proximity to 1 of C x (w)w~ 1 
when ||uj|| is small, on such a neighbourhood. 

The power series expansion of cir 1 ^ indicates ^ that for ||6|| sufficiently small, 
if C x .\- V (6) is invertible in £?, then so is o±(b). We conclude by analytic continuation 
that 

(15) 02(b) = Cx(oi(6)) (l + 6- 1 C a (o 1 (6))-(C B (oi(6))- 1 oi(6))" lx 



(16) 



i(b)) [(l 

6- 1 Cx(oi(6))-(C x (o 1 (6))- 1 o 1 (6))" 



c x ( 0l (b)y 



The first equality has the obvious meaning for all b € B(0, r), for some small enough 
r > 0, while ([l6| will be considered on some fixed open subset of B(0, r) on which 



C x+y (b) is invertible (for example, on some small neighbourhood of j^l with M > 1 
large enough). In that case, not only C x + y {b) — C x {pi(b)) = C y (o2(b)) is invertible 
in B, but so is C x (oi(b))~ 1 oi(b) - being close to one by equations ^ and (111 - 

and the whole parenthesis ^1 + b~ 1 C x (oi(b)) — (C 2 ,(o 1 (6)) _1 o 1 (fe)) , again being 
close to 1 by the same argument. 
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Our next objective is to show that if 36 > (for example, if 6 belongs to 
a neighbourhood of 4£1 as described above), then 3oi(6), $502(6) > 0. For this 
purpose it will be convenient to express 0\ as a fixed point in a formula involving the 
definition ^ of C as an expectation of a resolvent: by (16), C x (oi(b)) = C y (02(b)) 
is equivalent to 



C x ( 0l (b)) = C y ( [(l + 6- 1 C x (o 1 (6)) - (C,(o 1 (6))- 1 0l (6)) X J ^(oiffe))" 1 



again under the same hypotheses of invertibility of C x + y (b). Rewrite this by using 
the definitions of C x and C y (see Q): 

E[(l-o 1 (6) 2 ;)- 1 ] 01(6) 



E 



= E 



(l + 6- 1 C 2 ,(o 1 (6)) - (C x ( 0l (6))- 1 o 1 (6)) C,( 0l (6))- 1 

6- 1 C :c ( 0l (6))-(C ;c (o 1 (6))- 1 0l (6))" 1 )c ;c (o 1 (6))- 1 ] _1 
(l + 6- x C x (oi(6)) - (C 2 ,( 0l (6))- 1 0l (6))" 1 ) CMb))- 1 - y 



We assume 6 6 -6(0, r) has positive imaginary part, and hence is invertible. Since 
C is of the form 6(1 + v(b)), with limf,_>o v(b) = 0, for r sufficiently small, the 
invertibility of 6 will imply the invertibility of all the three functions C involved, 
while the invertibility of 6 H> C x (oi(b))~ 1 oi(b) is guaranteed by having chosen r > 
sufficiently small. So we can rewrite the above as 

(17) E[(l-oi(6)x)- 1 ]o 1 (6) 

1 - (C ;c ( 0l (6))- 1 o 1 (6))" 1 ) C x ( 0l (b))- 1 +b- 1 -y 



= E 

By definition, C x (w) = wE [(1 — xw)^ 1 ] . For \\w\\ sufficiently small, E [(1 — xu>)~ 1 ] 
is invertible, so the invertibility of C x (w) in B will imply the invertibility of w when- 
ever ||w|| is small enough; indeed, the inverse is trivially 

w' 1 = E [(1 - xw)- 1 ] (wE [(1 - xw)' 1 ])' 1 . 

Now since C x (w)~ 1 w = (wE [(1 — xw)^ 1 ]) 1 w — E [(1 — , 

(l-^H" 1 ^- 1 )^^)- 1 = (l-E[(l-xw)- 1 ])(wE[(l 

= -E [(1 — xw^x] w (E [(1 — wx)~ '\ w) 

= E [-x(l - wx)- 1 ] E [(1 - to)" 1 ] _1 

= E [x(wx - l)' 1 ] E [(1 - to) -1 ] ~* , 

which has an analytic extension around zero. As for ||6|| small enough, |joi(6)|| is 
small, invertibility of C x + y (b) = C x (o\(b)) — C y (02(b)) in B will imply the invert- 
ibility of Ox (b), 02(b). Under the assumption that 36 > and ||6|| is small enough, 
so that invertibility of C x+y (b) imply the invertibility of 01(6), 02(6), we can rewrite 



xw ) 
\— in 
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the above equality [T7] as 

E[(l- 0l (6)a;)- 1 ]o 1 (6) 



E 



E 



x( 0l (b)x-l) x j E [(1 - oi(b)x) 1 



b- 1 



Dividing by the (invertible) element E [(1 — oi(b)x) x ] G B gives 

(18) oi (b) = E [(E [x(oi(b)x - l)" 1 ] + - y)E [(1 - oi^a;)" 1 ])" 1 

We will argue now that for elements b £ M + (B) with the property that — 

is very large, while 9?(6 _1 ) stays bounded, o\(b) G H + (S). This will follow very 

easily from equation ( 18 1 as soon as we prove the following two 

Claims: 

(1) For a given k G (0, +oo) and v € H + (.M) with 3t> > kl, there exists 



e = 2^ > so that II c — 1|| < e 



3(wc) > f 1. 



(2) For any e > and x = x* G .M there exists a <5 > so that for any w G M, 
\\w\\ <S el < 3E - to)- 1 ] < el. 

We prove claim (1) first. We have 

Sy(uc) = —(vc — c*v*) 



2i 
1 

1 

2% 



(y(c — 1) + v — (c* — — u*) 
(u(c- 1) - (c- !)*«*) 



Since for any selfadjoint a we have — ||a||l < a < ||a||l, we get 

-i|Kc-i)-( c -iyv||i < I( W ( C _ i)-( c _i)v) < i|Kc-i)-( c -ir«*||i. 

It follows that -IMIIIc- ill < ll( w ( c - !)- ( c - l)*f*) < ||w||||c- 1||, i.e. 



-e||«||l< _(„( c _i)_(c_ !)*„*)< e||«||l, 



and so 



3t>-e|M|l < — (u(c- 1) - (c-l)*v*) + Sv = %(vc) < e\\v\\l + 5sv 
2i 

Thus picking the positive e = ^jjy will guarantee that SJw— |l < 3(uc) < 5u+ |l. 
This proves the first claim. 

Claim (2) is quite trivial: observe that ^(^(l—wx) -1 ) = (xw* — l)~ 1 x(^sw)x(wx— 

so making sufficiently small will provide the desired result. 
We apply the first claim to v = y — b^ 1 and c = E [(1 — oi(6)x) _1 ] : this means we 
require that ||c- 1|| < M/2\\b~ 1 - y\\, while at the same time > Ml > 1. 

For this to happen, it is enough that j 



\\oi(bM 



|oi(6)a|| 



< 



M 



2||b-i-y|| ' 



or, equivalently, 

||oi(6)x|| < M+2\\b- 1 -y\\ • From the similar property of C, we know that for any 
e > there exists a S = S E<Xiy > so that ||6|| < 5 => ||oi(b)|| < ||6||(1 + e). 
Now, for ||oi(fe)x|| < M+2\[b- 1 -y\\ to happen, it is enough that ||oi(6)|| \\x\\ < 

M+2||a(6-i)i|+2H«(6-i)||+2|| 3/ || - We wiU restrict ||»(6 _1 )|| G (0,1) and p^ 1 )!! G 
(M,2M) (these two inequalities together with the requirement — 3(6 _1 ) > Ml 
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determine an open nonempty set). This will make our condition ||oi(6)||||x|| < 

A/+2||a(6-i)n+2||«(6-i)||+2|| 3/ || t0 be im P lied by \\°i( b )\\M < Bjtf+a+a|li/|| ■ Thc con ~ 
ditions on -Ji, 3(6 _1 ) imply that ||6|| < -h. As M —> oo, this will send ||6|| to 

zero, and so for any chosen e > 0, if M is sufficiently large, ||o 1 (&)|| < < 

ii nee */fn , mi w\ ■ This shows that the first claim can be applied to v = y — b^ 1 

||a;||(5M+2+2||i/||) a 

and c = E [(1 — oi(&)a;)~ 1 ] to conclude that for b in an open set having zero in its 
closure we have 3((y — 6 _1 )E [(1 — oi(6)x) -1 ]) > 1. 

By making the M above sufficiently large, we finally apply the second claim to 
w = Oi(6) and our given x to obtain the inequalities — ^1 < QE [x(l — oi(fe)a;) _1 ] < 
|l. We conclude that for sufficiently large M G (0, +oo) and b so that ||3f?(6 _1 )|| G 
(0,1), G (M,2M) and -O^ 1 ) > Ml, 

3? (E [x(l - o^tyx)- 1 ] +(y- b^)E [(1 - o^x)' 1 ]) > 

By the positivity of E, the fact that w i— >■ to -1 maps H + (B) bijectively onto M~(B), 
and equation (18), we conclude that for such b G H + (J5) we have Qoi(6) > 0. 
Relation ( 16 ) can then be re- written as 

(19) o 2 (b)=[C x (o 1 (b))- 1 -o 1 (b)- 1 +b- 1 ]- 1 . 

Recalling that C x (oi(&)) _1 = F x (oi(6) _1 ) and %F x (w) < 3to when to G B~(B), it 
follows that 3(^(01 - oi(fo)- 1 + 6- 1 ) < 0, so 3o 2 (o) > 0. 

bmcc Oj (b) = (UjQ)- 1 ))- 1 , j G {1,2}, we found elements b G U+(B) so that 
Wj(6) G H+(i3), and so we can write G x (uJi(b)) — G x+y (b) — G y (uj2(b)). This 
proves the existence of thc functions omega on a common set in the operator upper 
half-plane of B. 

We go now to the second part of the proof, namely that lim^oo fg n (w) exists for 
all b,w G M + (B), depends only on b (not on to), and is the attracting fixed point of 
fly. To do that, let us start by proving that for any e > 0, h y (H + (B) +iel) is norm 
bounded, with bound depending only on e and \\y\\. For the reader's convenience, 
we shall state and prove this fact in a separate lemma. 

Lemma 2.3. If (A4,E,B) is a C* -operator-valued non-commutative probability 
space, y = y* G M. is a B-valued selfadjoint random variable and 



,-1 



-l 

- 10, 



h y : B+(B) -+M+(B), h y (w) = E (to - y)~ 

then for all w G H + (i?) + %e\, we have 

\\h y (y)\\<4\\y\\(l + 2s- 1 \\y\\). 

Proof. We shall use the same trick as in [BJ, namely for a fixed to = u + iv G M + (B) 
so that v > el and ip: B — > C positive linear functional, we define the analytic 
functions 

f VtW :C + ^C+U1, f ViW (z) = ^o{h y {u + zv)). 
A straightforward computation (already performed in [6j Lemma 2.3]) shows that 
lim h y (u + zv) = -E[y], lim z{h y (u + zv) + E[y\) = E[y\v~ l E[y] - E{yv~ 1 y}. 

z— >oo z — foo 

The continuity of positive linear functionals on C*-algebras allows us to conclude 
that 

lim f^ w (z) = -<p(E[y]), lim z(f v , w {z) +<p(E\y])) = y(E[j/]t< _1 E[j/] —E[yv~ 1 y}). 
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Thus, by the Nevanlinna representation jl] Chapter III], there exists a positive com- 
pactly supported measure p on the real line with p(M) — ip(¥,[yv~ 1 y] — E[y]i> _1 E[y]) 
so that 

This gives a bound for f VtW {z) as follows: 

\U, w {z)\ < I - ¥>(%])! + ^p(m) 

< IMIIMI + ^IMWEiyv-'y] -Eiy^Eiyjl 
Since tp(h y (w)) = f v , w (i), it follows that 

\<p(hy(w))\ < M (\\v\\ + IHyv^y] - E^-^^H) < ||p|||| v || (l + 2 £ - 1 ||y||) , 

for any linear positive ip and any w £ W + (B) with > el. Since, by the Jordan 
decomposition, any continuous linear functional on B is a sum of four positive linear 
functional, it follows that 

sup \<p(h y (w))\ <A\\y\\ (l + 2e- 1 ||y||), 

llvIKi 

for all w £ M. + (B) + iel. Since B is in particular a Banach space, we know that it 
embeds isometrically in its bidual, so ||6|| = sup^y^ |<^(b)| for any b £ B, where tp 
runs through the dual of B and the norm is the norm on the dual of B. Thus, 

\\h y (w)\\ < 4IIJ/H (l + 2e- 1 ||y||) , for w £ H+(B), > el, 

which proves our lemma. □ 

We shall use this lemma to prove that for any fixed b £ H + (B), Qb > el, there 
is an to > depending on b, x and y so that 

(a) f b maps B(0, 2m) n (U+(B) + i% l) := {w £ M+(B) + if 1 : < 2m} 
into itself, and 

(b) the inclusion is strict, in the (stronger) sense that 

(20) inf u £ f b (s(0,2m) n (e+(B) . 

v£B\ B(0,2m)n (m. + (B)+i~l) | > 0. 

Fix a b £ H + (B), and choose e > so that 9& > el. Then, by the above lemma, 
sup \\f b (w)\\<\\b\\+ sup ||^H||<||6||+4||y||(l + 4 £ - 1 ||y||)< TO . 

weM+(B) ti)6H+(B)+ifl 

We shall take 

(21) TO:=||&||+4||y||(l + 4e- 1 ||i/||). 

On the other hand, as mentioned above, ^sh y (w) > for all w £ M + (B), so that 
^fb(w) > 36 > el > §1. This guarantees that {w £ M+(B) + «§ 1: \\w\\ < 2m} 
is mapped by ft into itself, proving item (a) above. We shall give an explicit 
lower bound in order to show item (b): if a w £ {w £ M + (B) + ||w|| < 2m}, 
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then either ||u>o|| > 2m, and then \\wq — fb{w)\\ > ll^oll — ll/&( u ')ll > m f° r an 
w G H+(B) + ifl, or Qw f §1, and theij] 

HAH - woll > P/ b ( W ) - 9too|| > e/2. 

We conclude that the distance between ({u; G H + (S) ||w|| < 2m}) and 

B\({io G M + (B) + i|l : ||iu|| < 2m}) is no less than min{m, e/2}, a strictly positive 
number. 

This allows us to apply the Earle-Hamilton Theorem [TBI, Theorem 11.1] to con- 
clude that /b has a unique attracting fixed point u>b in the set {w G H + (i?) + 
iel: \\w\\ < m}, with m — \\b\\ +4\\y\\ (l + 4e _1 1| y||) , constant provided by Lemma 
|2.3| According to this theorem, the map fb is a strict contraction in the Caratheodory 
metric (not necessarily in the norm metric); however, since the two metrics are com- 
parable on sets strictly inside the given domain (i.e. at strictly positive distance 
from the complement of the domain) , it follows that lirrin^oo fjj n (w) = Wb happens 
both in the Caratheodory and the norm metric whenever w is chosen strictly inside 
the domain of Thus, as in the proof of [5J Theorem 2.2], it follows that the map 
b i ^ Wb is Gateaux analytic, and, by its - global, not only local - boundedness on 
bounded domains in H + (B) + i|l for any e > 0, it is Frechet analytic - see [T5] , 
Since this point Wb is in M+(B) + b whenever 36 > el, item (1) of our theorem is 
proved. 

To conclude our proof, we shall show next that for elements b as in the first part 
of our proof, Wb = (oi(6 -1 ) -1 = u>i(b). The uniqueness of analytic continuation will 
allow us to conclude. Denote by w' b the fixed point of the map w i— > h x (h y (w)+b)+b. 
If Wb — h y (h x (wb) + b) + b and w' b = h x {h y {w' b ) + b) + b, then h x (wb) + b = 
h x (h y (h x (wb) + b) + b) + b, and, by the uniqueness of the fixed point in the upper 
half-plane, h x (w b ) + b = w' b . Similarly, h y {w' b ) + b = w b . Adding w b to the first and 
w' b to the second equation gives us w b +w' b = b+F x (w b ) = b+F y (w' b ). Thus, the fixed 
points in question satisfy item (2) of our theorem. On the other hand, by equation 



(p), it follows that 02(b)- 1 = C^oiib))- 1 + b- 1 - o^b)- 1 = h^o^b)- 1 ) + fe" 1 
(recall that C x (w)- 1 — F. x (w- 1 )) whenever b G M.~(B) of small enough norm is so 
that Qoi(b) < (which, as seen in (|19[), implies that ^02(6) < 0). Writing now 



C x (ox{b)) = C y (o 2 {b)) gives us 

F x { 0l (b)- X ) = cMb))- 1 = CydKio.ibr^+b- 1 )- 1 )- 1 = FyihMby^+b- 1 ). 

Adding oi(6) _1 to both sides and recalling the definition of h finally gives us 
01 (6)" 1 = MMoiW 1 ) + b- 1 ) + b-\ 

for b G M~(B) so that 3oi(6) < - the existence of an open set on which this 
happens having been proved in the first part of our proof. We perform the change 
of variable b H> b^ 1 to conclude that the correspondence 6 H> o^b -1 ) -1 is a fixed 
point in W + (B) for fb, so, by uniqueness of the fixed point of fb, must coincide with 
Wb- Since on this small open set, b H> oi(6 -1 ) -1 = u>i(b) satisfies also item (3), we 
conclude by analytic continuation that (3) holds for all b G M + (B). This concludes 
the proof. □ 



4f a > el and b = b* ^ |1 then (using the GNS representation of B on a Hilbert space) there 
exists a vector f of norm one so that < e(£|§)/2 = e/2. But (a£|£) > e, so ((a — 6)510 ^ £ /2> 

which implies ||a — b\\ > e/2. 
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3. Anderson's selfadjoint linearization trick 



Let A be a complex and unital *-algcbra and let selfadjoint elements x\, . . . ,x n £ 
A be given. Then, for any non-commutative polynomial p £ C{X\, . . . , X n ), we get 
an operator P — p{x\, . . . , x n ) G A by evaluating p at (x\, . . . , x n ). 

In this situation, knowing a "linearization trick" means to have a procedure that 
leads finally to an operator 

L P = b Q <g) 1 + bi ® xt H h 6„ ® x n £ Mat(C) ® „4 

for some matrices bo,. . . ,b n £ Mat(C) of dimension AT, such that z — P is invertible 
in .4 if and only if A(z) — Lp is invertible in Mjv(C) ® -4. Hereby, we put 



(22) 



A(z) = 



z 








for all z e 



As we will see in the following, the linearization in terms of the dimension Af£ N 
and the matrices bo, . . . ,b n £ Mjv(C) usually depends only on the given polynomial 
p £ C(Xi, . . . , X n ) and not on the special choice of elements £ A. 

The first famous linearization trick goes back to Haagerup and Thorbj0rnsen |19j . 
|20) and turned out to be a powerful tool in many different respects. However, there 
was the disadvantage that, even if we start from an selfadjoint polynomial operator 
P, in general, we will not end up with a linearization Lp, which is selfadjoint as 
well. Then, in [2] and [3], Anderson presented a new version of this linearization 
procedure, which was able to preserve selfadjointness. 

For readers convenience and since this will be one of our main tools, we recall 
here the main facts about Anderson's selfadjoint linearization trick, streamlined to 
our needs. 



, X n ) be given. A matrix 

£ M A r(C)®C(X 1 ,...,X„), 



Definition 3.1. Let p £ C(Xi, . 

p : ~ [v Q 
where 

• N £ N is an integer, 

• Q € Mjv-i(C) ® C(Xi, . . . , X n ) is invertible 

• and u is a row vector and v is a column vector, both of size N 
entries in C(X\, . . . , X n ), 

is called a linearization of p, if the following conditions are satisfied: 

b„ £ MjvfO, such that 



1 with 



(i) There are matrices bo, . . . , b n £ Mjv( 



Li, 



b ® 1 + h <Ei X 1 H h b r , 



Xr, 



i.e. the polynomial entries in Q, u and v all have degree < 1. 
(ii) It holds true that 



P 



-uQ~ 



From condition (i) it seems natural to call such an operator L p a linearization of 
p. But at the first glance, there seems to be no reason for assuming additionally a 
certain block structure of L p and moreover condition (ii) . The following well-known 
result about Schur complements will make this clear. 
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Proposition 3.2. Let A be a complex and unital algebra and let elements a, b,c,d £ 
A be given. We assume that d is invertible in A. Then the following statements 
are equivalent: 

(i) The matrix a \ is invertible in Mo (CI ® A. 

c d 

(ii) The Schur complement a — bdr x c is invertible in A. 

I] the equivalent conditions (i) and (ii) are satisfied, we have the relation 



a b 


-l 


'0 " 




1 


c d 




d- 1 


+ 


-dr x c 



(23) 



Proof. A direct calculation shows that 



(a - bd^c)' 1 [1 -M- 1 ] 



(24) 

holds. Since the matrices 



a b 




'l bdr 1 ' 




c d 




1 





bd- 



bdr r 
1 



and 



: 
d 

1 

d-\ 



1 

d-\ 



are both invertible in M2(C) <S>A, the stated equivalence of (i) and (ii) immediately 
follows from (24). Moreover, if (i) and (ii) are satisfied, (24) leads to 



a b 


-l 


"1 


-d- x c 




c d 







1 





bd- 







d- 1 





1— 1 







-bd- 1 


1 



from which (23) directly follows. 



□ 



The following Corollary is a direct consequence of Proposition |3.2| and explains 
that we need the additional assumption on the block structure made in part (ii) of 
Definition 13.11 



Corollary 3.3. Let A be a unital and complex algebra and let elements x%, . . . , x n £ 
A be given. For any polynomial p € C(Xi, . . . , X n ) that has a linearization 

L p = b ® 1 + h ® X 1 H h b n ® X n e M N (C) ® C{X 1: . . . , X n ) 

with matrices bo, . . . , b n € Mjv(C) 7 the following conditions are equivalent for any 
complex number z 6 C : 

(i) The operator z — P with P :— p(x%, . . . , x n ) is invertible in A. 

(ii) The operator A(z) — Lp with A(z) defined as in (22) and 

L P := b ® 1 + 6i ® x x -\ h b n ® x n € Mjv(C) ® ^ 

is invertible in M^v(C) ® ^4. 
Moreover, if (i) and (ii) are fulfilled for some zGC, we /iaue t/iat 

[(A^-ipJ-^^^^-P)- 1 . 

Proof. For all £ N, we can consider the evaluation homomorphism 

$ w : Mfc(C) ® C(Ai, . . . ,X n ) M fe (C) (g) .A, g ^ «(a:i, . . . , as„). 

Since invertible elements in Mfc(C)(8>C(A 1 , . . . , X n ) are mapped by $^ fc ^ to inve rtibl e 
elements in Mfc(C) ® A, we see that Lp = $>( N '(L p ) admits due to Definition 
block decomposition of the form 

// 



3.1 



L f 



£ M N (C) <g> A 
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where Q £ Mjv-i(C) ® A is invertible and P = —uQ v holds. Hence, by using 
Proposition |3.2| we can deduce that for any z £ C the matrix 



A(z) ~ L p 



z —u 
-v -Q 



is invertible in M„ (C) ® A if and only if its Schur-complement 

z + uQ~ 1 v = z — P 

is invertible in A. This proves the stated equivalence of (i) and (ii). Moreover, we 
may deduce from ( 23 ) that in this case 

[(A(z)-L P )-\ 1 = (z-P)- 1 

holds. This completes the proof. □ 

Now, it only remains to ensure the existence of linearizations of this kind. 

Proposition 3.4. Any polynomial p £ C(X±, . . . ,X n ) admits a linearization L p in 
the sense of Definition \3. i} 



Proof. Since each monomial Xj £ C(X±, . . . , X n ) has the linearization 



X 3 

1 -1 



£ M 2 



and all polynomials of the form 
p := X. h X l2 ■••X ih £ C(X 1 , 
have the linearizations 



for k>2,i 1 ,...,i k £ {1, 



L p — 



X 



Xi. 



Xj 



-1 



£ M fe (C)(g)C<Xi 



we just have to observe the following: If the polynomials p\ , 
have linearizations 



■ ■ i X n ) , 

,p k e c(Xt, 



• , x n ) 



ii, 



for j = 1, 



. , 7i, then their sum p := p\ 
■ u k 



£M Nj (C)®C(X 1 ,...,X n ) 

• • +Ph has the linearization 



Lp — 



»i 

vi Qi 



£ M N (C)®C(X u ...,X n ) 



with N := (Ni H h N k ) - k + 1. 



□ 



It is a nice feature that we can extend this algorithm to preserve selfadjointness. 
Note that C{X±, . . . , X n ) becomes a *-algebra by anti-linear extension of 

(X h X i2 ■ ■ ■ X ik )* := X ik ■ ■ -X l2 X h for h, . . . ,ik € {1, ... ,n}. 

Corollary 3.5. Let p £ C{Xi, . . . , X n ) be a selfadjoint polynomial. Then p admits 



a selfadjoint linearization L p in the sense of Definition 3.1 
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Proof. Since p is sclfadjoint, we have p = q + q* for q := %. 



u 
v Q 



e M Ar (C)«)C(X 1 ,. 



, x„ 



with 



Let 
<1 : 



be a linearization of q. Then 







"0 


u 


V 


L p = 




u* 





Q 




V 


Q 





gives a selfadjoint linearization 


of p 


ously fulfilled and, 


since 






— \u v*] 


" 

Q 


Q 

* 


-1 


u 

V 



G M 2A r_i(C)«C(X 1 



= - [u V*} 



Q" 
[(Q*)- 1 

= -uQ^v - {uQ^vf 

= q + q* 
= p, 





*" 

n 







condition (ii) of Definition 3.1 holds as well. 



□ 



We conclude with the following corollary, which will enable us to shift A(z) for 
z G C + to a point 

z 

ie 

ie 

lying inside the domain H + (Mat(C)) in order to get access to all analytic tools that 
are available there. 

Corollary 3.6. Let (A, <f>) be a C* -probability space and let elements Xi, . . . , x n G A 
be given. For any selfadjoint polynomial p G C(X\, . . . ,X n ) that has a selfadjoint 
linearization 

L p = b <g> 1 + h <g> Xi H h b n ® X n G M N (C) ® C{Xi, . . . , X n ) 

with matrices bo, ■ ■ ■ , b n € Mjv(C) sa , we put P := p(x\, . . . , x n ) and 

L P := bo ® 1 + 6i ® xi H h 6„ <8> x„ G Mat(C) (g) A 

Then, for each z G C + and a^ sufficiently small e > 0, i/ie operators z — P G .4 
and A e (z) — Lp G Mjv(C) (g) .4 are &o£/i invertible and we have 

hm[E((A e (z) - M^)] u = Gp(z). 

Hereby, E : Mjv(C) <E> -A — >• Mjv(C) denotes the conditional expectation given by 
E := id Mjv (C) ®<£- 

Proof. It is an easy observation that the matrix valued resolvent set of Lp, namely 
the set f2 of all matrices b G Mjv(C) for which 6 — Lp is invertible in Mjv(C) ® .4, 
is an open subset of Mjv(C). 

Since P is selfadjoint, z — P is invertible for all z G C + . Hence, it follows from 
Corollary 3.3 that A(z) G for all z G C + . This implies that the function 

G : il^M N (C), b^E[(b- Lp)- 1 } 
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is holomorphic (and particularly continuous) in a neighborhood of A(z) for all 
z e C + . This shows the invertibility of A e (z) — L p for all z e C + and sufficiently 
small e > 0. By using Corollary |3.3| again, we observe 

[E[(A(z) - Lp)- 1 ]] u = 4>{[(A(z) - Lp)- 1 ] M ) = <f,((z - P)- 1 ) = G P (z). 

Thus we get that 

lim [E((A e (z) - Lp)- 1 )} 1A = hm[G(A e (z))] M = [G(A(*))] M = G P (z) 
holds as claimed. □ 

4. The algorithmic solution of Problems 1 and 2 

As already mentioned in the introduction, the subordination result for the operator- 
valued free additive convolution stated in Section 2 provides in combination with 
Anderson's version of the linearization trick presented in Section 3 an algorithm 
to calculate the distribution of any selfadjoint polynomial in selfadjoint elements 
which are freely independent. The following theorem gives the precise statement. 

Theorem 4.1. Let (A, <f>) be a non-commutative C* -probability space, x±, . . . , x n e 
A selfadjoint elements which are freely independent, and p e C{X\, . . . , X n ) a 
selfadjoint polynomial in n non-commuting variables X%, . . . , X n . We put P := 
p{x\, . . . ,x n ). The following procedure leads to the distribution of P. 
Step 1: According to Corollary \3.5\ p has a selfadjoint linearization 

L p = b ® 1 + bi ® X l H h b n ® X n 

with matrices bo, b\, . . . , b n € M7v(C) sa of dimension N G N. We put 

L P := 6 ® 1 + bi <E> xi H h b n <E) x n e Mat(C) ® ^4. 

£tep 2; T/ie operators b\ ® x\, . . . ,b n ® x n are freely independent elements in 
the operator-valued C* -probability space (Mjv(C) ® A, E), where E : Mjv(C) > 
Mat(C) denotes the conditional expectation given by E := id^^ic) Further- 
more, for j — l,...,n, the M^(C)-valued Cauchy transform Gf )j ^ )Xj is completely 
determined by the scalar-valued Cauchy transforms G Xj via 

G bj ® Xj (b) = ]im— [ (b-tb^-^iG^it + ie^dt 

for all be H+(Mjv(C)). 

Step 3: Due to Step 3, we can calculate the Cauchy transform of 

L P - b (g) f = bi ® xi H + b n ® i n 

6y using the fixed point iteration for the operator-valued free additive convolution. 
The Cauchy transform of Lp is then given by 

G Lp (b) = G Lp - bom (b-b ) for all be M+(M N (C)). 

Step 4 : Corollary \3.6\ tells us that the scalar-valued Cauchy transform Gp of P 
is determined by 

G P {z) = lim [G Lp (A e (z)] 1 1 for all z e C+ 

Finally, we obtain the desired distribution of P by applying the Stieltjes inversion 
formula. 
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Based on the results of the previous sections, there is almost nothing left to prove 
here. Only the two statements in Step 2 might require an explanation. Firstly, 
the fact that scalar- valued freeness between the entries of matrices A±,..., A n e 
Mm{A) implies freeness over Mjv(C) of the matrices themselves is a direct conse- 
quence of the definition of freeness. Secondly, the validity of the formula for the 
operator-valued Cauchy transform of bj ® Xj in terms of the scalar valued Cauchy 
transform of G Xj can be seen as follows: Let [ij be the distribution of Xj. It is an 
easy observation that for all b £ H + (Mjv(C)) the formula 

E[(6 - bj ® xj)' 1 ] = / (b - tbj)' 1 dfj,j(t) 

holds. Hereby, the integral on the right hand side is defined entrywise, which 
corresponds in this case to the definition of the well-known Bochner integral. By 
using the Stieltjes inversion theorem, this leads us to 

E[(b-bj ^Xj)' 1 ] =lim— 1 [ (b - %)- 1 S(G /4 + ie)) dt. 

Particularly, this means that knowing the Cauchy transform G Xj of Xj suffices to 
calculate the matrix- valued Cauchy transform of bj ® Xj . 

5. Examples 

In this section we will present a few examples of different polynomials to show 
the power and universality of our results. We will in all those cases compare the 
distribution arising from our algorithm with histograms of eigenvalue distributions 
of corresponding random matrices. For the random matrices we will chose cither 
Gaussian or Wishart random matrices; their limit distribution is almost surely 
given by the semicircle law and the Marchenko-Pastur (aka free Poisson) distribu- 
tion, respectively, and independent choices of such matrices will almost surely be 
asymptotically free (with respect to the normalized trace on the matrices); see, for 
example, [251 l22l [28] . 

Example 5.1 (The anticommutator). We consider the non-commutative polynomial 
p e C(X 1 ,X 2 ) given by 

p(X 1 ,X 2 )=X 1 X 2 +X 2 X 1 . 

It is easy to check that 

" Xx X 2 

Xi -1 
X 2 -1 



Lp — 



is a selfadjoint linearization of p in the sense of Definition (3.1 1 



Now, let Si, s 2 be free semicircular elements in a non-commutative C* -probability 



space {A, 4>). Based on the algorithm of Theorem 4.1 we can calculate the distri- 
bution of the anticommutator p(s±, s 2 ) = sis 2 + s 2 si. 

Consider, on the other hand, two sequences (x[ n ^) ne fq and (X^™ )neN 01 inde- 
pendent standard Gaussian random matrices. For each n € N, we consider the 
empirical eigenvalue distribution of the anticommutator 

P (x[ n \xi n) ) = x[ n) xi n) +xi n) x[ n) . 

By the asymptotic freeness results of Voiculescu, the eigenvalue distribution of 
those matrix anticommutators converges, as n tends to infinity, to the distribution 
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Figure 1. Comparison of the distribution of p(si,s 2 ), accord- 
ing to our algorithm, with the histogram of eigenvalues for 
p(x[ n \ X^ ) , for n = 4000; here p is the free anticommutator 



of the anticommutator sis 2 + s 2 si. Fig. [T] compares the eigenvalue distribution of 
p(x[ n) ,X { 2 n) ), for n = 4000, with the result of our algorithm for the distribution of 
P(si,s 2 ). 

Example 5.2 (Perturbated anticommutator). In the same way, we can deal with 
the following variation of the anticommutator: 



p(X u X 2 ) = x x x 2 + x 2 x 1 + xl 



It is easy to check that 





X\ 

TjXl + X 2 



Xi 


-1 



\X\ + x 2 
-1 





is a selfadjoint linearization of p in the sense of Definition plj ). Fig. [2] compares 
the result of our algorithm for p(s, w) - where s, w are free and s is a semicircular 
and w a free Poisson element - with random matrix simulations of the eigenvalues 
of p(X,W), where X is a 4000 x 4000 Gaussian random matrices and IF is a 
4000 x 4000 Wishart random matrix, both chosen independently. 



Example 5.3. We consider the polynomial 



p(Xi,X 2 , X 3 ) — XiX 2 Xi + X 2 X 3 X 2 + X 3 XiX 3 . 



SUBORDINATION AND FIXED POINTS 



21 



A 
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Figure 2. Normalized eigenvalue histogram for one realization of 
p(X±,X2) = X1X2 + X2X1 + X\ for one independent realization 
of a 4000 x 4000 Gaussian random matrix X\ and a 4000 x 4000 
Wishart matrix X2 , compared to the density of the distribution of 
p(s,w) for a semicircular element s and a free Poisson element w, 
which are free. 

It is not hard to see that 



L, 









Xi 





x 2 





X 3 





x 2 


-1 














X\ 


-1 


























x 3 


-1 








x 2 








-1 


























X x 


-1 


X 3 














-1 






gives a selfadjoint linearization of p in the sense of Definition (3.1). If we con- 



sider independent Gaussian random matrices X^ and X^ 1 ' and Wishart random 
matrices X^ n \ the limiting eigenvalue distribution is given by the distribution of 
p(si, S2,w), where si,S2,w are free elements in some C*-probability space (A, <fi), 
such that si and S2 are semicircular elements and w is a free Poisson. Fig. [3] com- 
pares the result of our algorithm for p(sx, 82, w) with the eigenvalues of a random 
matrix simulation for n — 4000. 



r {n) 
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